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Abstract —  This  paper  addresses  the  carrier-phase  estimation 
problem  under  low  SNR  conditions  as  are  typical  of  turbo- 
and  LDPC-coded  applications.  In  [1],  [2]  closed-loop  carrier 
synchronization  schemes  for  error-correction  coded  BPSK  and 
QPSK  modulation  were  proposed  that  were  based  on  feeding 
back  hard  data  decisions  at  the  input  of  the  loop,  the  purpose 
being  to  remove  the  modulation  prior  to  attempting  to  track 
the  carrier  phase  as  opposed  to  the  more  conventional  decision- 
feedback  schemes  that  incorporate  such  feedback  inside  the 
loop.  In  this  paper,  we  consider  an  alternative  approach  wherein 
the  soft  information  from  the  iterative  decoder  of  turbo  or 
LDPC  codes  is  instead  used  as  the  feedback. 

I.  Introduction 

In  recent  years  there  has  been  an  ever-increasing  interest 
in  highly  power  efficient  error-correction  codes  such  as 
turbo  codes  and  low  density  parity  check  (LDPC)  codes. 
These  codes  approach  the  Shannon  channel  capacity  of  the 
system  and  operate  at  very  low  symbol  signal-to-noise  ratios 
(SNRs)  thus  necessitating  the  need  for  carrier  synchroniza¬ 
tion  schemes  that  likewise  operate  efficiently  at  these  SNRs. 

A  significant  research  effort  is  underway  in  the  area 
of  joint  decoding  and  carrier  phase  estimation.  As  clearly 
explained  by  Noels  et  al.  [3]  two  somewhat  distinct  groups  of 
joint  decoding  and  synchronization  algorithms  have  evolved. 
The  first  of  these  approach  the  problem  by  modifying  itera¬ 
tive  detection/decoding  algorithms  and/or  graphs  to  include 
parameter  estimation.  A  partial  list  of  work  on  this  approach 
includes  [4]— [9] . 

Of  particular  interest  has  been  the  work  of  Colavolpe  et  al. 
[9]  where  phase-tracking  processing  nodes  were  introduced 
in  the  iterative  decoding  graph.  Dauwels  et  al.  also  inves¬ 
tigated  [7]  specially  adapted  message-passing  update  rules. 
Howard  et  al.  [8]  proposed  a  pilotless  modulation  technique 
for  turbo-coded  differential  8-PSK  modulation  which  uses 
35  iterations  to  compensate  a  7t/8  phase  offset  at  Eh/N0  = 
4.5  dB.  We  also  note  the  work  of  Nuriyev  [6]  who  has 
adapted  density  evolution  to  evaluate  the  performance  of 
joint  carrier-phase  estimation  in  a  pilot-assisted  environment. 
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The  second  group  of  algorithms  pass  messages  between 
an  independent  phase  estimation  block  and  an  essentially 
unmodified  iterative  decoder.  The  resulting  architectures  are 
often  said  to  employ  turbo  synchronization  [3].  Algorithms 
of  this  type  can  can  be  found  in  [10]— [14], 

The  technique  in  this  paper  falls  into  this  second  category 
and  has  the  potentially  attractive  feature  that  little  modifica¬ 
tion  is  required  with  either  the  iterative  decoder  or  the  carrier 
recovery  block  (which  consists  primarily  of  a  phase-locked 
loop  (PLL)).  Specifically,  the  work  leverages  the  fact  that 
LDPC  symbol  estimates  can  ‘wipe-off’  modulated  symbols 
in  a  decision  directed  carrier  recovery  loop  to  enhance  the 
carrier  information  such  that  a  classic  residual  carrier  PLL 
is  able  to  provide  increasingly  accurate  phase  estimates  over 
LDPC  iterations.  The  method  incurs  a  latency  penalty  (by 
way  of  increased  iterations)  as  carrier  phase  is  acquired. 
However,  complexity  in  terms  of  system  description  and  area 
(in  the  case  of  a  real-time  implementation)  remains  similar 
to  that  of  state  of  the  art  residual  carrier  recovery  techniques 
currently  used  for  BPSK  and  QPSK  modulation  in  NASA’s 
deep-space  network. 

The  authors  in  [10]  propose  somewhat  similar  work  but 
have  described  a  phase  estimate  based  on  the  instantaneous 
average  of  an  entire  block  of  received  symbols.  We  also  note 
the  work  of  Lottici  et  al.  [12]  who  developed  a  blind  recovery 
technique  for  QAM  receivers.  The  work  in  this  paper  is  also 
based  on  blind,  or  pilotless,  operation  and  we  motivate  this 
in  part  by  recalling  a  result  from  Anastasopoulos  [4]  who 
showed  pilotless  techniques  to  be  more  efficient  at  lower 
SNRs  where  pilot  insertion  loss  is  considerable. 

The  rest  of  this  paper  is  organized  as  follows.  The  next  sec¬ 
tion  provides  a  detailed  description  of  the  proposed  method. 
In  Section  III,  we  consider  BPSK  modulation  and  derive 
the  tracking  performance  of  the  PLL  in  terms  of  its  mean- 
square  phase  error  when  operating  in  the  linear  (high  loop 
SNR)  region  as  is  typical.  In  Section  IV  we  illustrate  a 
digital  implementation  that  achieves  the  same  performance 
as  the  piecewise  constant  analog  model  considered  in  Section 
III.  Results  for  QPSK  modulation  are  presented  in  Section 
V.  Section  VI  presents  numerical  results  derived  from  a 
simulation  of  the  BPSK  scheme  with  a  particular  LDPC 
code.  Finally,  Section  VII  documents  our  conclusions. 

II.  System  Description 

The  notion  of  iterative  information-reduced  carrier 
synchronization  for  coded  binary  phase-shift-keyed 
(BPSK)  modulation  was  first  introduced  in  [1],  The  term 
“information-reduced”  alludes  to  the  use  of  an  estimate  of 
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the  instantaneous  data  symbol  (and  thus  of  the  instantaneous 
phase  modulation)  to  reduce  the  amount  of  randomness 
(and  thus  the  amount  of  information)  in  the  signal  being 
processed  in  the  carrier  synchronizer. 

Specifically,  in  information-reduced  carrier- 
synchronization  (IRCS),  the  reduction  of  the  amount 
of  information  is  accomplished  by  attempting  to  convert 
the  received  modulated  carrier  to  an  unmodulated  carrier 
(pure  tone)  before  applying  it  to  a  phase-tracking  loop, 
in  the  hope  of  improving  performance.  Traditional  IRCS 
systems  for  synchronization  with  carrier  signals  modulated 
by  BPSK  include  Costas  loops,  data-aided  loops,  and 
demodulation/remodulation  loops.  The  traditional  systems 
are  designed  to  implement  various  approximations  of  a 
closed-loop  structure  that  effects  maximum  a  posteriori 
(MAP)  estimation  of  phase.  The  degradation  of  tracking 
performance  of  such  a  loop  in  the  case  of  BPSK  is 
represented  by  a  quantity  called  the  “squaring  loss”,  which 
is  a  measure  of  the  degradation  of  the  receiver  signal-to- 
noise  (SNR)  ratio  and  is  associated  with  the  mean-squared 
phase  error  of  the  loop.  In  the  case  of  a  conventional 
in-phase/quadrature  (I-Q)  carrier-tracking  loop,  the  mean- 
square  phase  error  is  a  result  of  signal  and  noise  cross 
products  that  are  generated  in  the  effort  to  remove  the  data 
modulation  from  the  loop  error  signal.  At  low  symbol  SNR, 
the  squaring  loss  of  an  I-Q  loop  can  be  severe  enough  to 
prevent  tracking.  Several  publications  based  on  this  notion 
have  appeared  in  the  literature  that  include  everything  from 
the  basic  idea  and  accompanying  analysis/performance 
evaluations  [  1  ]  to  successful  application  and  implementation 
for  specific  block  and  convolutional  codes  [15],  [16]. 

If  the  data  sequence  and  its  timing  were  completely 
known,  then  a  BPSK  signal  could  be  converted  to  a  pure 
tone  simply  by  multiplying  the  BPSK  signal  by  the  data 
waveform.  One  could  then  track  the  unmodulated  carrier  with 
improved  performance  by  use  of  a  phase-locked  loop,  which 
does  not  exhibit  squaring  loss.  Short  of  complete  knowledge 
of  the  data  waveform  and  in  the  presence  of  noise,  the  best 
approximation  of  a  pure  tone  could  be  obtained  by  feeding 
back  an  estimate  of  the  data  waveform  corresponding  to 
tentative  decisions  on  the  data  symbols.  Such  feedback  is 
called  “decision  feedback”. 

Decision  feedback  is  used  within  the  traditional  loops,  but 
is  not  used  to  modify  the  loop  structures.  In  the  proposed 
IRCS  system  (see  Fig.  1),  decision  feedback  is  introduced  at 
the  input  terminal  of  the  loop;  simultaneously,  the  structure 
of  the  loop  would  be  modified  (in  the  sense  that  its  parame¬ 
ters  would  be  modified)  on  the  basis  of  the  associated  change 
in  data-transition  statistics  in  the  input.  In  this  scheme,  the 
input  signal  would  be  converted  to  a  close  approximation  of 
a  pure  tone,  with  a  resultant  improvement  in  carrier-tracking 
performance  over  conventional  I-Q  loops. 

Although  initially  available  data-waveform  estimates  are 
generally  of  low  quality,  they  can  be  used  to  initiate  the 
IRCS  process  by  reducing  the  number  of  data  transitions  at 
the  input.  Once  phase  lock  is  achieved,  the  improved  phase 
estimates  can  be  fed  back  to  the  data  detector,  yielding  im- 
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Fig.  1.  Analog  receiver  with  Information-Reduced  Carrier  Synchronizer 
Using  Soft  Decision  Feedback  for  BPSK 

proved  symbol  estimates  for  feedback,  and  thereby  achieving 
even  better  phase  tracking.  This  iterative  process  eventually 
leads  to  virtual  elimination  of  squaring  loss,  so  that  the 
performance  of  the  system  approaches  that  of  a  phase-locked 
loop  operating  on  an  unmodulated  carrier  signal. 

III.  Tracking  Performance  for  BPSK  Modulation 

The  sample  analog  receiver  shown  in  Fig.  1  will  aid  us 
in  the  derivation  of  the  proposed  carrier  synchronization 
method.  An  alternative  practical  implementation  of  this  re¬ 
ceiver  is  detailed  in  Section  IV.  Consider  an  input  BPSK 
modulation  of  the  form 

?/i (f;  0C)  =  V2Pmn{t)sin  ( wct  +  9C)  +  m(t)  (1) 

where  wc  and  6C  are  the  carrier  frequency  and  phase.  Carrier 
power  P  is  affected  by  a  bandpass  AWGN  process  ni(f)  that 
can  be  expressed  as 

ni(i)  =  x/2  [ Nci(t)cos(wct  +  9C)  —  Nsi(t)sin(wct  +  6C)\) 

(2) 

where  iVcl(i),  Nsl  (t)  have  single-sided  noise  power  spectral 
density  (PSD)  equal  to  N0  and 

OO 

m(t)  =  ^2  dkP  ( t  ~  kTs) 

k=— oo 

is  a  baseband  modulation  with  independent,  identically  dis¬ 
tributed  (i.i.d.)±l  data  symbols  dk  and  unit  rectangular  pulse 
shape  p{t)  of  duration  Ts  or  a  root-raised  cosine  pulse  with 
zero  crossings  a  multiples  of  Ts.  For  simplicity  from  this 
point  forward  we  will  assume  a  rectangular  pulse  shape. 
Under  this  assumption  matched  filters  will  be  equivalent 
to  integrators.  The  next  step  is  to  delay  yi(t.;Oc)  by  the 
decoder  delay  A  and  multiply  it  by  a  normalized  (by  the 
signal  amplitude  A)  1  version  of  the  soft  decision  feedback 
signal  corresponding  to  the  extrinsic  information  derived 
from  decoding  the  LDPC  code.  This  can  be  modeled  as  a 
Gaussian  signal  [17],  [18]  ,  i.e., 

1  "the  purpose  of  the  normalization  is  to  make  the  signal  component  of 
the  feedback  independent  of  the  extrinsic  symbol  information  amplitude, 
A.  This  in  turn  results  in  the  input  to  the  carrier  tracking  loop  being 
independent  of  this  same  amplitude  thereby  not  affecting  the  choice  of  the 
loop  bandwidth.  In  practice,  one  would  need  to  estimate  A  per  iteration  to 
perform  this  normalization.  For  the  purpose  of  our  theoretical  discussion 
here,  we  shall  assume  that  this  estimation  is  perfect. 
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2/2 (t)  =  m(t  -  A)  +  n2(t)/A 

where  n2(t)  can  be  modeled  as  a  piecewise  constant  base¬ 
band  noise  process,  namely, 


n2(t)  =  ^2  n2kP  ( t  -  kTs  -  A) 

k=— oo 

where  n2k  are  i.i.d.  zero  mean  Gaussian  random  variables 
(RVs)  with  variance  a2.  Over  a  single  iteration  (block  of 
input  symbols),  A  and  a2  are  assumed  to  be  fixed.  The 
result  of  this  multiplication,  whose  intent  is  to  remove  the 
modulation,  is 

u{t\9c)  =  yi(t  —  A;  Oc)y2(t) 

=  \/2Psin{  wct  +  9C) 

+{\J  (2  P)/A)m(t  —  A)n2(t)  sin(wct  +  9C) 
+\[2m(t  —  A)[Nci(t)cos(wct  +  9C ) 

- Nsl(t)sin(wct  +  9C)\ 

+{s/2/ A)n2(t)[Ncl{t)cos(wct  +  9C) 

- Nsl(t)sin(wct  +  0C)\ 

(3) 

which  is  the  sum  of  a  pure  sinusoidal  tone  at  the  carrier 
frequency  plus  a  mixture  of  (signal  x  noise)  and  (noise  x 
noise)  terms.  The  signal  in  (3)  is  then  input  to  a  PLL  whose 
voltage-controlled  oscillator  (VCO)  output  can  be  expressed 
as 


rVco(t)  =  V2cos{wct  +  9C )  (4) 

then  we  multiply  (3)  and  (4)  together  and  low-pass  filter  (to 
remove  frequency  components  at  2 wc)  to  obtain 


where 


Nc ifc  =  f£+1)T°  Ncl(t)dt 
Ns ifc  =  j£r)Ts  Nsl(t)dt 


(6) 


are  zero  mean  Gaussian  noise  RVs  with  variance  tjn2  = 
N0TS/ 2.  Clearly  from  the  above,  the  slope  of  the  S'— curve, 
Kg,  is  given  by 


Kg  =TSVP  (7) 

We  now  compute  the  autocorrelation  function  of  v(k,  0C) 
(treated  as  a  piecewise  continuous  process  v(t,  (f>c)  )  from 
which  we  shall  obtain  the  equivalent  noise  PSD  affecting 
the  loop.  For  operation  in  the  neighborhood  of  <j>c  =  0,  it 
is  reasonable  to  consider  only  the  autocorrelation  function 
of  n(/c,0).  Assuming  n\(t)  and  n2(t)  are  independent  and 
the  noise  samples  are  independent  from  symbol  interval  to 
symbol  interval,  then  the  autocorrelation  is  triangular 


Rv(t )  =  E  {v(t,  0 )v(t  +  t,  0)} 


0  otherwise 


with 

=  E{v2(k,0)}=E{N2clk(dk  +  ^)2} 

=  a2n(l  +  a2/A2)  =  ^(l  +  a2/A2) 

Thus,  the  equivalent  single-sided  noise  PSD,  Ne,  is  given  by 


N,  = 


/  — OO 

Rv{t ,  0 )dr  =  N0T2{  1  +  a2 /A2)  (8) 

-OO 


Finally,  the  mean-square  phase  error  in  the  loop  is  given  by 


z(t )  =  \f~Psin{(j)c)  +  (V  P  /  A)m(t  —  A)n2(t)sin(<pc) 

+m(t  -  A)  x  [. Ncl(t)cos((/)c )  -  Nsi(t)sin(<frc)\ 
+  (1  /A)n2(t)  x  [Nci{t)cos(4>c)  -  Nsl(t)sin((f>c)\ 

=  '/Psin(<pc)  +  v(t,  <j>c) 

where  <f>c  =  9C  —  0,  denotes  the  phase  error  in  the  loop.  Note 
that  the  loop  tracks  the  phase  error  (f>c  as  opposed  to  twice 
the  phase  2<br:  error  as  in  the  more  conventional  Costas  loop. 


2  NeBL  NqBl  (  a2  \  a  1 

^  K2  P  \  +  A2 )  PSL 


(9) 


where  Bl  is  the  noise  bandwidth,  p  =  P/(N0Bl)  is  the 
loop  SNR  in  a  conventional  PLL  and 


SL 


A 


(10) 


Next  we  pass  z(t)  through  a  matched  filter  (I&D  in  the 
case  of  square  pulses)  to  produce  (in  the  kth  interval  (k  + 
1)TS  <  t  <  (k  +  2 )TS)  the  piecewise  constant  error  signal  2 


^ k 


IkTs^TS  Z(f)dt  _ 

TaVPsin(^c)  +  (V  P/ A)Tsdkn2ksin{(j>c) 


+dk[Nclkcos{(t)c)  -  Nslksin((j)c)\ 

+(1/ A)n2k[Nclkcos{(l)c)  -  N Slksin{(f>c)\ 
TsyfPsin(4>c)  +  v(k,  tj>c) 


(5) 


2Without  loss  in  generality,  we  herein  ignore  the  decoder  delay  A. 


is  the  degradation  of  the  loop  SNR  analogous  to  the  "squar¬ 
ing  loss”  in  a  conventional  Costas  loop.  The  quantity  A2/ct2 
represents  the  decoder  soft  SNR  estimate.  For  iterative  de¬ 
coding  of  LDPC  and  turbo  codes  [17],  [18],  the  mean  and 
the  variance  are  related  by  the  symmetry  condition  a2  =  2A. 
Using  this  relation  (10)  becomes 

<>« 

As  the  iteration  proceeds,  the  estimated  data  SNR  increases 
and  likewise  the  squaring  loss  decreases  (i.e.,  S[R  ap¬ 
proaches  unity).  By  comparison,  for  a  Costas  loop,  the 
expression  for  the  squaring  loss  is  given  by 
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Fig.  2.  Digital  implementation  of  BPSK  Receiver  with  Information-Reduced  Carrier  Synchronizer  Using  Soft  Decision  Feedback 


SCL  ±  (1  +  1/(2  R,))-1 
Rd  =  P.TS/N0 

and  thus  remains  fixed,  independent  of  the  iteration  process, 
for  a  given  symbol  SNR.  To  numerically  evaluate  the 
performance  in  (11),  one  needs  to  quantify  the  functional 
dependence  of  the  decoder  soft-estimate  of  the  data  SNR 
and  the  input  symbol  SNR. 

IV.  A  Practical  Digital  Implementation 

The  circuit  in  Fig.  2,  shows  a  practical  implementation  of 
the  carrier  recovery  loop  presented  in  the  previous  section. 
Once  again  we  assume  a  rectangular  pulse  shape  although 
a  bandwidth  efficient  Nyquist  pulse-shape  can  also  be  used. 
The  first  difference  with  the  block  diagram  in  Fig.  1  is  that 
demodulation  (convert  to  baseband)  of  the  input  signal  (1) 
is  done  with  the  carrier  phase  error  still  present,  using  the  I 
and  Q  reference  signals  (arbitrarily  assuming  them  to  have 
zero  phase  relative  to  the  received  signal).  We  thus  multiply 
the  input  by  y/2sin{wct)  and  y/2cos{wct),  and  obtain 

x^F{t;  9C )  =  VPm(t)  ( sin  (2 wct  +  9C)  +  sin(9c)) 

+  y/2n\{t)cos{w  ct) 

=  y/Pm{t).  { sin  (2 wct  +  9C)  +  sin{9c )) 

+  Nci(t)cos(2wct  +  9C)  +  Nci(t)cos(9c) 

—  Nsi(t)sin(2wct  +  9C )  —  Nai(t)sin(9c) 

xFF(t;  9C)  =  y/ Pm{t )  (—cos  (2wct  +  9C )  +  cos[9c)) 

+  \/2ni{i)sin{wct) 

=  y/ Pm{t).  {—cos  (2 wct  +  9C)  +  cos{9c )) 

—  Nci{t)sin{2wct  +  9C )  +  Nci{t)sin{9c) 

+  Nsi(t)cos(2wct  +  9C)  —  Nsi(t)cos(9c) 

Applying  a  low-pass  filter  to  remove  frequencies  at  2wc 
yields: 


xc(t\ 9C)  =  y/Pm{t)sin{9c )  +  Nci{t)cos{9c) 
—Nsi(t)sin(9c) 

xs{t;  9C)  =  y/Pm{t)cos{9c)  —  Nci{t)sin{9c ) 

- Nsl{t)cos{9c ) 

The  demodulated  signals  are  then  passed  through  (I&Ds) 
(or  matched  root  raised  cosine  filters)  to  give 

zck  —  yj PTsdkSin{9c)  T  Nc\^cos{9c )  N S\^szn{9 c) 
zsk  =  VPTsdkcos{9c)  -  Nclksin{9c)  -  Nslkcos(9c) 

in  the  interval  (k  +  1)TS  <  t  <  (k  +  2)TS.  Next  multiply 
zck  and  zsk  by  the  normalized  soft  decision  feedback  sample 
(decoder  extrinsic  information) 

2/2/c  =  dk  +  n2k/A  (12) 

where  as  before  over  a  given  iteration  {ri2fc}  are  modeled  as 
i.i.d.  zero  mean  Gaussian  RVs  with  variance  o2.  The  result 
of  this  multiplication  removes  the  modulation  and  produces 

Uck  =  zcky2k  =  y/PTssin{9c)+ 

[{dk  +  n2k/A){Ncikcos{9c)  -  Nsiksin{9c))+ 
n2k/ AyfPTsdk.sin{9c)\ 

=  y/~PTssin{9c)  +  vck, 

usk  =  VPTs.cos{9c)+ 

[{dk  +  n2k/A){-Nclksin{9c)  -  Nslkcos{9c))+ 
n2kJAy[PTsdkcos{9c )] 

=  yj  PTscos{9c)  +  vsk, 

which  is  then  input  to  a  digital  PLL  whose  number-controlled 
oscillator  (NCO)  produces  an  estimate  of  the  carrier  phase 
denoted  by  9C.  Multiplying  uck  and  usk  by  wck  =  cos{9c) 
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Fig.  3.  QPSK  receiver  with  Information-Reduced  Carrier  Phase  Synchro¬ 
nizer  Using  Soft  Decision  Feedback 


and  wsk  =  sin(6c),  respectively,  and  then  differencing  the 
results  of  these  products  provides  the  error  signal 

&k  —  V'ck'Mck  V'sk'Wsk 

=  \'/PTs  (sin(4>c)  +  sin(9c  +  6C)J 
- \y[PTs  sin(<t>c )  +  sin(6c  +  6cfj 
vckcos(9c )  -  vsksin(9c) 

=  \/PTs.sin{(j)c)  +  vckcos(9c )  -  vsksin(9c) 

03) 

where  as  before  <j> c  =  9C  —  9C  denotes  the  phase  error  in  the 
loop.  Comparing  (13)  with  (5)  we  see  that  they  are  identical 
and  thus  the  performance  of  the  digital  PLL  would  also  be 
described  by  (9)  together  with  (10). 

V.  Tracking  Performance  for  QPSK  Modulation 

As  we  did  in  Section  III,  we  now  present  a  sample  receiver 
for  QPSK  signals  shown  in  Fig.  3.  Consider  an  input  QPSK 
modulation  of  the  form 

2/i  (t;  9C)  =  \fPmi  ( t )  sin  (uct  +  9C) 

+y/PrriQ  ( t )  cos  (u)ct  +  9C)  +  m  (t) 

where,  analogous  to  (2),  the  noise  is  modeled  by 

ni  ( t )  =  v/2  [Nci  ( t )  cos  ( Luct  +  9C )  —  Nsi  ( t )  sin  (u>ct  +  9C )] 

and  the  I  and  Q  modulations  are  given  by 

OO 

mi  (t)  =  dikP  (t  -  kTs) 

k—  —  oo 

OO 

mQ  (t)  =  Y*  dQkP  (t  -  kTs) 
k=— oo 

As  before,  multiply  yi  (t ;  9C)  (again  ignoring  the  decoder 
delay)  by  a  normalized  version  of  the  soft  decision  feedback 
signal  corresponding  to  the  extrinsic  information  for  the  I 
data  stream,  namely, 

2/2 /  (t)  =  mi  (■ t )  +  n2i  (t)  /A 


where  n2{T)  is  again  modeled  as  a  piecewise  constant 
baseband  noise  process,  namely, 

OO 

n2i(t)=  ^2  n2ikp  (t  —  kTs)  (14) 

k=— oo 

with  {n2ik}  being  i.i.d.  zero  mean  Gaussian  RVs  with 
variance  a2.  Now  also  phase  shift  the  received  signal  by 
7t/2  rad.  to  form  the  quadrature  input 

2/i  (f;  6C-  7t/2)  =  -  \fPini  (t)  cos  (wct  +  9C) 

+VPmQ  (■ t )  sin  (ujct  +  6C)  +  uiq  (t) 

where 

Uiq  ( t )  =  y/2Ncl  (f)  sin  ( uct  +  9C) 

+V/2Arsi  (t)  cos  (u)ct  +  9C) 

and  multiply  this  by  a  normalized  version  of  the  extrinsic 
information  for  the  Q  data  stream,  namely, 

2/2 q  (f)  =  mQ  ( t )  +  n2Q  (■ t )  /A 

where,  analogous  to  (14), 

OO 

n2Q  (t)  =  ^2  n2QkP  (t  -  kTs) 

k=— oo 

and  the  sequence  {n2Qk}  is  assumed  to  have  the  same  prop¬ 
erties  as  the  sequence  {n2ik}.  Furthermore,  it  is  reasonable 
to  assume  the  two  sequences  independent  of  each  other.  The 
results  of  the  above-mentioned  multiplications  are  given  by 

1 1/  (t ;  9C)  =  \[P  sin  {ojct  +  9C) 

+y/Pmi  (t)  mQ  (f)  cos  ( uict  +  9C ) 

+  Up /A)  mi  (t)  n2i  (t)  sin  (c uct  +  9C) 

+  Up /A)  mQ  (t)  n2i  (t)  cos  (ujct  +  9C) 

+V2mi  ( t )  [7Vcl  (t)  cos  (ujct  +  9C) 

-Nsi  ( t )  sin  (u>ct  +  9C)\ 

+  (V2 /A)  n2i  (; t )  [iVcl  (f)  cos  (u>ct  +  9C) 
-Ns i  (f)  sin  (u>ct  +  6C)} 

uq  (f ;  9C )  =  y/P  sin  (u>ct  +  6C) 

—  y/Pmi  (f)  mQ  (t)  cos  {wct  +  6C) 

Vp/a )  mi  (■ t )  n2Q  (t)  cos  (ioct  +  6C) 
Vp/a)  mQ  ( t )  n2Q  (t)  sin  (uct  +  6C) 

+a/2 mQ  (t)  [Nci  (t)  sin  (uct  +  6C) 

+NS  i  (t)  cos  (uct  +  9C)] 

+  (y/2/A)  n2Q  (t)  [Nci  (t)  sin  (wct  +  6C) 
+Nsl  (f)  cos  (uct  +  6C)} 

Adding  Ui(t:  6C)  and  UQ(t;9c)  eliminates  the  cross¬ 
modulation  signal  term  and  produces  a  signal  that  is  again 
the  sum  of  a  pure  sinusoidal  tone  at  the  carrier  frequency 
plus  a  mixture  of  (signal x noise)  and  (noise x  noise)  terms. 

The  signal  u(t;  9C)  is  input  to  a  PLL  which  after  demodula- 
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tion  by  the  reference  signal  in  (4)  gives 

z  (t)  =  -s/^Psin  (<f>c) 

+  (^\/ P/2/A^j  [to/  (t)  ri2i  (t)  +  rriQ  (f)  u2q  (t)]  sin  {<j>c) 

+  (^/P/2/Asj  [itiq  (t)  n2i  (f)  -  mi  (t)  n2Q  (f)]  cos  (</c) 

+  [to/  (t)  Ncl  (t)  +  mQ  (f)  Nsi  (/)]  cos  (</>c) 

+  [toq  (t)  Ncl  ( t )  -to/  (t)  iVsi  (f)]  sin  (</)c) 

+  (1/A)  [n2/  ( t )  A/i  (f)  +  n2Q  (f)  1Vsi  (f)]  cos  (<j>c) 

+  (1/A)  [ n2Q  (t)  Nci  (t)  -  n2I  ( t )  iVsi  (f)]  sin  (</>c) 

As  in  the  BPSK  case,  next  we  pass  z(t)  through  a  matched 
filter  to  produce  (in  the  kth  interval  )  (k  +  1)  Ts  <  t  < 
(k  +  2)  Ts  the  piecewise  constant  error  signal 

r(k+l)Ta 

ek  =  z  ( t )  dt  =  TS\/2P  sin  (</>c) 

J  kTs 

+  (VP/2/A)  Ts  [dIkn2Ik  +dQkn2Qk }  sin  (cj>c) 

+  (\/ -P/2/A^j  Ts  [dQferi2/fe  —  dikn2Qk]  cos  (cj)c) 

+  [ dikNclk  +  dQkNslk)  cos  (</>c)  ’  ! ' : ' 

[dQkNcik  djkNs i&]  sin  (/>c) 

+  (1/A)  [n2/fclVclfc  +  n2QfcATgife]  cos  (</c) 

+  (1/A)  [?r2Qfe-/Vcifc  — n2/fclVsifc]  sin  (<j>c) 

=  Ts  V2P  sin  ((f>c)  +  v  (k,  </>c) 
where  Ncik,Nsjk  are  dehned  in  (6). 

Once  again  we  must  compute  the  slope  of  the  S-curve  and 
the  equivalent  noise  PSD.  From  (15),  the  slope  of  the  S-curve 
is  immediately  given  by 

Kg  =  TSV2P.  (16) 

The  variance  of  the  additive  noise  v(k.  0)  is  readily  deter¬ 
mined  to  be  3 

a2v  =  N0TS  [1  +  (1  +  Rd)  a2 /A2]  (17) 

and  thus  from  (8),  the  equivalent  noise  PSD  is 

Ne  =  2 Tsa2v  =  2 N0T2  [1  +  (1  +  Rd)  a2 /A2]  (18) 

Finally,  applying  (16)  and  (17)  to  (9),  the  mean-square  phase 
error  in  the  loop  becomes 


d  = 


NeBL 

K2 

9 


NoBl 

p 


1  +  (1  +  Rd) 


A2 


A  1 

~pSp 


or  equivalently,  the  “squaring  loss”  is  given  by 

S[R=  [1  +  (1  + Rd)  a2 /A2]'1 
which  applying  A /a2  =  1/2  simplifies  to 


SILR  =  [1  +  (1  +  Rd)(2/A)} 


-1 


(19) 


3The  additional  noise  variance  factor  (1  +  II, i )  comes  from  the  presence 
of  a  quadrature  signal  X  noise  term  that  is  absent  in  the  BPSK  case. 


Comparing  (19)  with  (11)  we  immediately  observe  the 
additional  penalty  (dependent  now  on  symbol  SNR:  Rd) 
in  carrier  tracking  performance  using  a  PLL  for  QPSK 
relative  to  BPSK.  For  small  symbol  SNRs,  this  penalty 
becomes  insignificant.  Furthermore,  note  that  because  of 
the  creation  of  a  pure  tone  by  the  soft  decision  feedback, 
thus  allowing  the  use  of  a  PLL,  there  are  no  fourth  order 
(signal  x  noise)  or  (noise  x  noise)  products  in  the  loop  as  in 
the  conventional  QPSK  Costas  loop  or  information-reduced 
carrier  synchronization  loop  with  hard  decision  feedback. 
Thus,  the  “squaring  loss”  penalty  is  inherently  smaller  than 
the  “quadrupling  loss”  penalty  associated  with  the  above- 
mentioned  loops. 

As  was  the  case  for  BPSK,  it  is  also  possible  to  construct 
an  alternative  digital  implementation  that  once  again  would 
yield  the  same  performance  as  its  piecewise  continuous 
analog  counterpart  discussed  above. 

VI.  Iterative  Processing  and  Numerical  Results 

We  have  evaluated  the  performance  of  the  all-digital  BPSK 
baseband  approach  described  in  Section  IV  via  joint  decod¬ 
ing  with  an  rate-1/2  (1944,  972)  irregular  low-density  parity- 
check  (LDPC)  code  developed  in  [19].  After  a  complex 
rotation  to  resolve  phase  ambiguity  (discussed  below),  the 
signals  zck  and  zsk  are  multiplied  by  the  decoder  output 
y2k  to  form  uck  and  usk.  As  described  in  previous  sections 
and  shown  in  [1],  if  the  PLL  input  has  a  small  fraction  of 
total  modulated  symbols  in  a  block  successfully  removed 
then  it  can  begin  to  produce  a  reasonable  phase  estimate, 
even  at  relatively  low  SNRs.  We  have  found  that  the  es¬ 
timation/decoding  process  can  be  successfully  started  by 
assigning  y-2k  =  zsk  (Subsequent  iterations  derive  y2k  from 
the  decoder  as  described  by  (12)).  After  this  assignment, 
the  PLL  operates  once  across  all  symbols  in  a  codeword. 
LDPC  decoder  log-likelihood  ratio  inputs  are  then  produced 
by  combining  the  updated  PLL  phase  estimates  with  zck  and 

Zsk 


Qk  —  a2  ( zskwck  4-  zckwsk) 

=  =T-  (yPTsdkcos{4>c)  -  Nciksin((j>c)  -  N slkcos{(j)c)^ 

where  aflr  =  PT2 /(2ES/N0). 

An  “extrinsic”  LLR  feedback  mechanism  was  employed 
in  which  prior  LDPC  inputs  are  subtracted  from  current 
outputs  before  new  inputs  (from  the  most  recent  PLL  up¬ 
date)  are  added.  Also,  state  information  in  the  decoder  (in 
particular  the  most  recent  extrinsic  information  arriving  from 
check-nodes)  is  preserved  between  LDPC-to-PLL-to-LDPC 
iterates.  The  accumulator  in  Fig.  2  implements  the  first  order 
transfer  function 


H(z) 


Kv  +  Kiz-1 
1-z-1 


where  Kp  =  10-6  and  K,  =  10~s  were  selected. 

The  bit  and  frame  error  rate  performance  of  the  system 
are  shown  in  Fig.  4  for  different  update  schedules  between 


451 


WIIC.230 


(a) 

Fig.  4.  IRCS  BER/FER  performance  with  a 

the  decoder  and  the  PLL  circuit  and  for  two  cases  of  phase 
error  <bc.  Simulations  with  <pc  -  0  and  (!),■  —  7r/4  represent 
cases  of  minimum  and  maximum  initial  phase  error.  Cases 
with  (f>c  —  0  are  not  shown  in  the  figure  since  they  always 
achieve  the  same  performance  as  the  stand-alone  code.  All 
other  initial  phase  offsets  have  error  rate  performance  that 
lies  between  these  two  cases.  The  total  number  of  LDPC 
iterations  was  set  to  either  20  or  50.  The  set  of  curves  labeled 
(20-20)  and  (50-50)  shows  the  error  rate  performance  when  a 
decoder  iteration  is  followed  by  a  PLL  update.  An  alternative 
schedule  (20-10)  and  (50-25)  where  a  PLL  update  is  done 
after  two  decoder  iterations  is  also  shown. 

A  plot  of  loop  SNR,  l/fj^  (the  inverse  of  the  measured 
average  squared  estimation  error),  as  a  function  of  the 
number  of  iterations  where  (f>c  —  0  and  (Pc  =  7t/4  is 
shown  in  Fig.  5  in  conjunction  with  (50-25)  scheduling  (up 
to  50  LDPC  iterations,  with  PLL  updates  on  every  other 
iteration).  Fig.  5(b)  explicitly  shows  the  difference  in  loop 
SNR  between  the  two  cases  in  Fig.  5(a).  Note  that  steady 
state  operation  is  reached  after  40  iterations  when  <f>c  =  0  and 
SNR  varies  between  1  and  2  dB.  Up  to  10  more  iterations 
are  required  when  the  initial  error  is  (pc  —  7t/4.  Note  that 
even  with  these  additional  iterations,  the  loop  SNR  does  not 
quite  reach  the  levels  observed  in  the  in  (f>c  =  0  case.  This 
marginal  degradation  in  loop-SNR  explains  the  small  (<  0.1 
dB)  performance  loss  in  BER/FER  performance  in  Fig.  4(b). 
Fig.  5  also  shows  that  in  the  case  of  only  20  iterations  (for 
either  initial  phase  offset)  the  PLL  has  not  reached  steady 
state.  The  associated  loss  BER/FER  in  performance  is  shown 
in  Fig.  4(a)  where  at  a  frame  error  rate  of  10~4  the  (20-10) 
schedule  loses  0.15  dB  and  the  (20-20)  schedule  loses  0.07 
dB. 

Note  the  curious  performance  in  Fig.  5(a)  of  the  zero- 
phase  offset  case  during  the  first  10  LDPC  iterations.  In  this 


Eb  \  No  [dB] 

(b) 


region  the  PLL’s  phase  estimate  (which  is  initially  correct)  is 
highly  noised  by  poor  initial  LDPC  soft-symbol  estimates. 
Both  LDPC  soft-symbol  estimates  and  loop-SNR  begin  to 
improve  dramatically  after  the  10th  iteration. 

We  conclude  this  section  by  noting  that  phase  ambiguity 
(for  offsets  greater  than  ±7r/2  can  be  resolved  by  first 
measuring  the  average  power  across  a  single  codeblock  of 
the  signals  zc  and  zs.  If  the  sine  component  (zc)  has  average 
power  greater  than  the  cosine  component  (,zs),  then  these 
two  components  are  swapped.  This  procedure  may  leave 
(or  induce)  a  remaining  error  of  it  radians.  To  resolve  this 
ambiguity  we  run  a  single  PLL  pass  followed  by  several 
(up  to  4)  LDPC  iterations.  The  orientation  that  produces  the 
maximum  number  of  satisfied  odd-degree  check  equations  is 
selected  and  the  decoding  procedure  is  reinitialized  4.  Similar 
techniques  are  proposed  in  [10],  [11]. 

VII.  Conclusions 

We  have  demonstrated  a  means  for  improving  the  carrier 
synchronization  function  for  iterative  decoded  BPSK  using 
information  derived  from  the  decoder  extrinsics  to  remove 
the  modulation  (information-reduction)  prior  to  the  carrier 
tracking  operation.  The  motivation  for  doing  this  is  to 
overcome  the  penalty  in  noisy  reference  loss  attributed  to 
the  large  squaring  loss  at  low  SNRs  that  is  characteristic 
of  the  traditional  BPSK  carrier  sync  loops  such  as  the 
Costas-type  loop.  In  comparison  to  the  information-reduced 
carrier  synchronization  loop  with  hard  decision  feedback  as 
proposed  in  [1]  and  [2],  the  scheme  described  in  this  work 
makes  use  of  soft  decision  extrinsic  information  and  does  not 
require  estimating  the  decoder  error  probability.  This  occurs 
as  a  consequence  of  the  assumption  here  of  a  fixed  carrier 
synchronization  structure,  i.e.,  a  PLL,  whose  design  does  not 

4Even  degree  checks  remain  satisfied  under  a  rotation  of  all  inputs  by  ^ r. 
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Fig.  5.  IRCS  Loop  SNR  Performance. 


change  with  knowledge  obtained  from  the  decoder.  While 
in  the  soft  decision  feedback  case  considered  here  such  a 
structure  would  only  be  asymptotically  optimum  (in  the  MAP 
motivation  sense)  at  high  SNR,  it  nevertheless  provides  a 
simple  yet  performance  efficient  carrier  synchronization  loop 
in  SNR  regions  of  interest  for  coded  applications. 
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